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ABSTRACT 

We present an analysis of the four-year data from the COBE DMR experiment. We use a 
Karhunen-Loeve expansion of the pixel data to calculate the normalization and goodness-of-fit 
of a range of models of structure formation. This technique produces unbiased normalization 
estimates and is capable of determining the normalization of any particular model with 
a statistical uncertainty of 7%. We present a parameterization of the normalization and 
likelihood function which summarizes our results for a wide range of models. We use the COBE 
normalization to compute small-scale fluctuation amplitudes for a variety of models and discuss 
the implications of these results for theories of large-scale structure. 

Subject headings: cosmology: theory — cosmic microwave background — large-scale structure of 
universe 
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Introduction 



The COBE DMR experiment has now been 



completed (Bcnnnett et aL 199t, Gorski et 



al. 1996| , [Hinshaw et aL 1996| , [Banday et al. 1996| ), 
and the four-year sky maps that were produced 
are the last word on large-angle cosmic microwave 
background (CMB) anisotropics that we are likely 
to have for some time. The impact of these 
data on models of structure formation has been 
immense. By providing a theoretically clean 
measure of the mass fluctuations in the linear 
regime, the COBE data has allowed for the first 
time a < 10% determination of the amplitude of 
the power spectrum of cosmological models. This 
normalization supersedes the previous best method 
of normalization based on the abundance of clusters; 
alternatively, the two normalizations together 
constrain one extra combination of cosmological 
parameters. The combination of cluster abundances 
and the COBE normalization on large-scales has 
definitively ruled out the "standard" Cold Dark 
Matter (CDM) model with adiabatic, scale-invariant 
initial fluctuations. For more discussion on the 
impact of the COBE data on large-scale structure 



theories, see ( [White fc Scott 1996a| ). 



The COBE data determine the amplitude of 
the fluctuation spectrum at large scales with a 
statistical error of 7%. In order to fully exploit this 
high-precision determination one must go beyond 
the simple Sachs- Wolfe ( ^achs fc Wolfe 1967| ) 
approximation to the large-angle anisotropy 
spectrum and a simple summary of the COBE 
data such as the rms fluctuation or the correlation 
function. In this paper we present normalizations 
of several popular CDM based models of structure 
formation by performing a maximum-likelihood fit of 
numerically computed anisotropy spectra directly to 
the COBE pixel data. Tests using Monte-Carlo data 
sets indicate that our maximum-likelihood estimates 
are unbiased and that the maximum-likelihood 
points provide "good" fits to the data. We quote 
our results for both the radiation and matter power 
spectra at large scales, which in any particular model 
have a definite relationship. We also discuss how one 



computes the normalization on smaller scales from 
our results. 

The outline of the paper is as follows. In §^ we 
discuss the method used to perform the maximum- 
likelihood fits to the COBE data, which is based 
on the Karhunen-Loeve (KL) transform. In this 
section we also present results of our Monte-Carlo 
simulations to test for bias in the fitting method. In 
^ we present two different frequentist methods for 
using the COBE data to constrain models, and we 
contrast these methods with the Bayesian analysis 
used in the rest of the paper. The normalization 
of the radiation power spectra for models in the 
CDM family is considered in where we also 
provide fitting functions for a class of models with 
approximately quadratic anisotropy spectra. For 
open (OCDM) and cosmological constant (ACDM) 
cold dark matter models we give likelihood functions 
vs. flo from the COBE data. In §|| we use the 
results of §| to find the amplitude of the matter 
power spectrum for a wide range of CDM models. 
After describing some implications of the COBE 
normalization for models of large-scale structure in 
^ we present our conclusions in 



2. Likelihood Analysis 

2.1. Notation 

The normalizations and likelihoods presented in 
this paper are based on an analysis of the four-year 



COBE DMR sky maps ( Bcnnnett et al. 1996 ). As 
discussed in (Bunn, Scott & White 199L, Banday et 



al. 1994 , White fc Bunn 1995 ) there is much more 
information in the COBE DMR data than simply the 
rms fluctuation, so detailed fitting of a theory to the 
pixel data is necessary to obtain all of the information 
available from this data set. We analyze the data by 
performing a Karhunen-Loeve (KL) transform, i.e., 
expanding the data in a set of modes that optimally 
retain signal and throw away noise. This method 
is also known as the signal-to-noise eigenmode 
analysis, or optimal subspace filtering. For further 



information on this method, see (Bond 1995, Bunn 
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& Sugiyama 1 995, [Bunn, Scott fc White 1995|, [White 
fc Bunn 1995| , |Bunn 1995| , [Vogeley fc Szalay 1996| ). 



Let us start by describing the hkehhood analysis 
of the COBE DMR pixel data. As usual, we expand 
the CMB temperature anisotropy AT/T in spherical 
harmonics: 



AT 



.(f)- 



(1) 



The COBE data have been shown to be consistent 



with Gaussian fluctuations (Kogut et al. 1996a) such 
as are predicted by the simplest inflationary theories 
and by most defect theories (on large scales). We 
will only consider Gaussian theories in this paper, 
hence each aim will be an independent Gaussian 
random variable of zero mean. Thus all of the 
cosmological information in the CMB is contained in 
the variances 

Ci = {\aim\^). (2) 

The quantities {Cg} are collectively referred to as 
the angular power spectrum. Our goal in analyzing 
the DMR data is to test hypotheses about the 
angular power spectrum. 

We work with a single DMR sky map consisting 
of a weighted average of the two 53 GHz maps and 
the two 90 GHz maps, in the ecliptic pixelization. 
We weight the maps according to the inverse of the 
noise variance in each pixel, in order to have minimal 
noise in the resulting map. Under the assumption of 
Gaussian statistics, this is mathematically equivalent 
to performing a multi-map likelihood analysis. (We 
do not use the 31 GHz maps for fear of Galactic 
contamination; in any case, the noise levels in these 
maps are high enough that little information is lost 
by not using them.) We remove all pixels that lie 



within the "custom cut" described in (Bennnett et 



al. 1996 ), leaving iVpix = 3890 pixels. In addition, 
we remove a best-fit monopole and dipole. This last 
step is formally unnecessary, since our likelihood 
analysis is explicitly forced to be insensitive to the 
monopole and dipole. 

Let di be the temperature measurement 
corresponding to the ith pixel, and let d be the iVpix- 
dimensional vector (di , . . . , dj^ ) . (Throughout 



this paper, we will denote vectors in real three- 
dimensional space by boldface type, and vectors in 
other spaces such as "pixel space" by arrows.) The 
datum di contains contributions from both signal 
and noise: 



AT 



(3) 



Here W{r, f') represents the window function of the 



experiment (Wright et al. 1994), the star denotes 



convolution, is the location on the sky of the ith 
pixel and n, is a random variable representing the 
noise. The DMR noise is approximately Gaussian, 
and noise correlations from pixel to pixel are weak 



( Lineweaver et al. 1994 ). The noise covariance 
matrix 

N = (nn'^) 



(4) 



is therefore approximately diagonal. 



Since the underlying, theoretically uncorrelated 
variables are specified in £ space we wish to write 
d in terms of the For notational convenience 

we will denote a pair of indices {£m) by a single 
Greek index such as /i. The correspondence between 
the two is fj, = £{£ 4- 1) -I- m 4- 1 so that fj, ranges 
from 1 to cx) as and m vary over their entire 
allowed ranges. We introduce an A'pix x oo matrix 
Y whose elements are — Y^{ri). The most 
natural way to represent the window function is by 
a diagonal matrix VF^j, — WiS^y, where £ is the 
index corresponding to /i and is the Legendre 
polynomial expansion of the window function. Then 
we can write 

d = YWd+n. (5) 

Here a and n are the vectors whose components are 
and respectively. 

The covariance matrix, C^j/, of a is diagonal, 
and its nonzero elements are those of the angular 
power spectrum Ct. 



(6) 



where £ is the index corresponding to as before. 
We know the covariance matrices of a and n, and we 
assume that the signal and noise are uncorrelated: 
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{a^Tii) = 0. We can therefore write down the 
covariance matrix of the data vector d: 



M = {d(F) = Y (WCW) Y'' 



N. 



The matrix WCW is the beam-smoothed angular 
power spectrum, and recaU W'^ = W. 



2.2. The Karhunen-Loeve Transform 



The full likelihood function is awkward to 
(7) compute, since it involves inverting an iVpix x A'pix 
matrix. This has been done for a small family of 



models using the two-year DMR data (Tegmark fc 



Since we are assuming that both the signal and 
noise are Gaussian, we know the entire probability 
distribution function of the data vector d: 



f{d\C) = 



1 



(27r)^pix/2 det^/2 M 



exp 



(8) 



We will regard / as a function of C rather than 
as a function of the data d: after all, d is fixed and C 
is unknown. In this context, / is called the likelihood 
and is denoted L{C). The maximum-likelihood 
estimate of some set of parameters q (which may 
of course be a set containing only one element), is 
that for which L(C{q)) = L{q) is maximized. The 
Bayesian credible region, assuming a uniform prior 
for the parameters, is a volume V bounded by a 
surface of constant L such that 



Bunn 1995 ), but if we want to look at a larger family 
of models, we need to find a way to "compress" 
the data before computing the likelihood. This is 
the purpose of the KL transform. The idea is to 
(linearly) project the data vector onto a subspace of 
smaller dimension, and compute likelihoods based on 
the projected data vector. We attempt to choose the 
subspace in such a way that the likelihood function 
based on the new "compressed" data vector contains 
almost all of the useful information in the original 
likelihood function. 

To be specific, suppose that our goal is to 
estimate the normalization of the power spectrum. 
Then the optimal _D-dimcnsional subspace to choose 
turns out to be that spanned by the D solutions 
da with the largest eigenvalues of the following 
eigenvalue equation: 



L{q) dq 



L{q) dq, 



(9) 



(10) 



where M^ig = YWCWY' 



is the signal part 



where c 


is the confidence level associated with the 


of the covariance matrix (Bond 1995, Bunn & 


region. 


and the integral on the right-hand side 


Sugiyama 1995, Bunn, Scott & White 1995, White 


extends 


over all of q space. Note that this procedure 


& Bunn 1994 Bunn 1995, Vogeley & Szalay 1996). 



is completely general, and in no way depends upon 
the assumption of Gaussian statistics. Of course, if 
we do not make the assumption of Gaussianity, then 
Eq. (^) is not valid, and we need to replace it with 
something else. 

These prescriptions tell us how to perform our 
parameter estimates. Unfortunately, we have no 
guarantee that these estimates will be unbiased. In 
the case of Gaussian statistics, parameter estimates 
are asymptotically unbiased; however, since we deal 
with finite sample sizes, this guarantee does not 
apply. If we are concerned about bias, we therefore 
have no recourse but to perform Monte Carlo 
simulations to test for it, as we discuss in SE^ 



We should warn the reader that there are two 
definitions of the eigenvalues in popular use. For one 
choice the eigenvalues are in units of signal-to-noise 
and range from to infinity. The other, used in this 
paper, has the eigenvalues in the range to 1. The 
two are related by the mapping = ^a/i^a — 1) 
with Aa as above. 

Of course, we don't know M a priori, since we 
don't know the power spectrum. (If we did, there 
would be no need to do a likelihood analysis!). We 
must therefore choose a "fiducial" power spectrum 
Ci in order to compute the eigenmodes. We use a 
fiat power spectrum £{£ + l)Ce =constant as our 
fiducial power spectrum for the entire likelihood 
analysis, although as we will describe below we have 
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Fig. 1. — The points show the eigenvalues associated 
with the KL expansion of the four- year COBE data, 
sorted in decreasing order. The solid curve is a 
running sum of the squares of these eigenvalues, 
normalized to a final value of 1. 
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Fig. 3. — Histograms of the mean-square power as a 
function of £ are shown for various eigenmodes. The 
modes plotted are numbers a = 1, 50, 100, 200, and 
400, from left to right. 



performed tests to show that our results do not 
depend significantly on this choice. 

The signal-to-noise eigenvalues are shown in 
Fig. |l|, where it can be seen that the first ~ 400 
contain significant signal. The accuracy with which 
we can measure power spectrum normalizations 
is determined by the sharpness of the peak of 
the likelihood function, that is, by the quantity 
— {L"{q)), where q represents the normalization and 
the derivatives are evaluated at the peak of the 
likelihood function. This quantity is proportional to 
X^a -^a; which is therefore the relevant figure of merit 
to describe our ability to reject incorrect models. 
This quantity is also plotted in Fig. |l|. 

Contour plots of a sample of eigenvectors in 
Aitoff projection are shown in Fig. The multipoles 
probed by these eigenvectors are shown in Fig. ||. 
Many of the eigenmodes show a surprising lack of 
symmetry between the upper and lower hemispheres; 
mode 400, for example, shows much more structure 
in the upper hemisphere than in the lower. In every 
such case, there is a nearly degenerate eigenmode 
that is a approximately a reflection of the given 
mode. For example, mode 399 looks very much 
like mode 400, but with most of its structure 



concentrated in the lower hemisphere. 

Once we have chosen our subspace, we can 
project the data vector down onto this subspace and 
compute likelihoods in terms of the projected vector. 
However, before we can use these likelihoods we need 
to take proper account of the monopole and dipole. 
When we remove a best-fit monopole and dipole 
from the data, we are inadvertently removing some 
contribution from the other modes, since incomplete 
sky coverage breaks the orthogonality of the 
spherical harmonics. From a Bayesian point of view, 
the natural way to correct for this is to marginalize 
over the monopole and dipole. That is, we should 
replace Lhy J L daQodai^idaigdan, where aoo is the 
monopole and ai,„ are the three dipole components. 
All four of these quantities are unknown, so we 
marginalize over them by letting them range over 
all possible values and integrating. Fortunately, 
the variation of L with these four quantities has a 
simple Gaussian form, and so the integral can be 
done analytically. This procedure turns out to be 
mathematically equivalent to forcing the eigenmodes 
to be orthogonal to the monopole and dipole, e.g., 
by Gram-Schmidt orthogonalization, as is done by 
Gorski et al. (1996a). 
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Fig. 2. — Aitoff-projected contour plots are shown o 
right, top-to-bottom. 

In previous analyses of the DMR data, the 
quadrupole has frequently been excluded along with 
the monopole and dipole. Although we will give 
a few quadrupolc-cxcludcd results below, most of 
our results will include the information contained 
in the quadrupole. Ever since the initial detection 
of anisotropy in the one-year COBE maps, it has 
been known that the COBE quadrupole moment 
is lower than one would expect from "preferred" 
cosmological models {i.e., those with roughly 
scale-invariant power spectra) normalized to the 
modes with £ > 3. Since we know a priori that 
the COBE quadrupole is anomalous (compared to 
our theoretical expectations), we should be quite 
hesitant to throw it away: in throwing away data 
that is known a priori to be discordant, we run 
the risk of biased data editing. In the absence of 




eigenmodes 1, 10, 50, 100, 200, running from left-to- 



compelling evidence of quadrupolar contamination, 
we therefore regard it as more prudent to retain the 
quadrupole. 

There are many ways to estimate the quadrupole 
from the DMR data, and in general they are not 
equivalent. (In the absence of complete sky coverage, 
there is no way to estimate a particular multipole 
without contamination from other multipoles.) If we 
estimate the five components of the quadrupole by 
simply integrating over the observed part of the sky, 

47r 

hm = -TJ X] diY2m{fi), (11) 

and estimate the quadrupole by computing 

= (l/47r)^^ |?'2mP and subtracting off noise 
bias, we find that Q = 5.9 /uK for the four-year 
DMR data. Using the same estimator, we find 
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quadrupoles of 8.9 and 6.6 for the one-year 
and two-year data sets. To decide whether this 
quantity is inconsistent with a particular model, 
we need to compare it with simulations. We find 
that Q is low enough to be inconsistent with a flat 
Harrison-Zel'dovich power spectrum at about the 
98% confidence level. One should be reluctant to 
rule out any models on this basis, however: we knew 
a priori that the quadrupole was low, and confidence 
levels based on a subset of data that is known a 
priori to be discordant can be misleading. 

2.3. Tests of our Method 

To define our subspace we truncate the 
eigenmode expansion at 500 vectors, as we find 
that increasing the number of modes makes 
little difference to the results. For example, 
the estimate of the quadrupole normalization 
Q = a/5CV4^ for a flat Sachs- Wolfe n = 1 
spectrum is Q = 18.73 ± 1.25 fiK using 500 modesQ. 
If we increase the number of modes to 700, we 
find Q = 18.67 ± 1.26 /l(K. Increasing the number 
of modes also fails to increase significantly our 
sensitivity to the shape of the power spectrum. For 
example, using 500 modes our determination of the 
spectral index n (for a pure Sachs- Wolfe spectrum) 
is n = 1.184 ± 0.282; using 700 modes this result 
becomes n = 1.177 ± 0.279. 

Removing the quadrupole information from 
the fit (by marginalizing over the quadrupole as 
well as the monopole and dipole) increased the 
normalization to Q = 19.6 for n = 1, and changes 
the best-fit value of n from 1.2 to 1.0. 

In order to assess the sensitivity of our results to 
the choice of fiducial power spectrum, we computed 
some likelihoods using an n = 1.5 Sachs- Wolfe 
power spectrum (as opposed to the flat n — \ 
spectrum) as our fiducial power spectrum. The 
results are virtually identical. For example, the 
maximum-likelihood estimate of Q for a flat n = 1 



^ With 500 vectors the typical time to evaluate the likelihood 
function at one point is 40 seconds on a Sparc-10. 



power spectrum, computed with an n = 1.5 fiducial 
power spectrum, is 18.74 /iK. Furthermore, the 
maximum-likelihood point in the (n, Q) plane is 
(1.2, 16.22 /iK) for either a flat or an n = 1.5 flducial 
power spectrum. 

As mentioned before, we have no a priori 
guarantee that our maximum-likelihood parameter 
estimates will be unbiased. We have therefore 
performed Monte Carlo simulations to test for 
bias. We generated 564 simulated sky maps with 
a pure n = \ Sachs- Wolfe power spectrum and a 
normalization of Q = 19 /iK, and we determined the 
maximum-likelihood normalization for each. The 
average estimated normalization was Q = 18.98 /iK, 
with a standard deviation of ct = 1.22 /iK. Note that 
this standard deviation is a frequentist estimate of 
the error in our estimate of the normalization, which 
can be compared with the Bayesian error estimate of 
1.25 /iK given above. 

In the above test, the input power spectrum 
was the same as the fiducial power spectrum. Since 
this might not be a fair test, we also performed 
simulations with an n = 1.5 Sachs- Wolfe power 
spectrum and a scale-invariant "standard CDM" 
spectrum. For the n = 1.5 spectrum, the input 
normalization was Q — H /iK, and the average 
estimated normalization was 12.99 /iK with a 
standard deviation of 0.83 /iK. In the simulations 
of a standard CDM spectrum, we used an input 
normalization of Q = 18 /iK; in this case, the 
mean estimated normalization was 18.03 /iK with 
a standard deviation of 1.16/iK. Based on these 
results, we are confident that there is no significant 
bias in our normalization estimates. 

We also tested for bias in our estimate 
of the slope n of the power spectrum. We 
made simulated maps with Sachs- Wolfe input 
spectra with parameters {n,Q) ~ (1,19/iK) and 
{n,Q) = (1.5,13/iK) as before, and found the 
maximum-likelihood point in the (n, Q) plane for 
each. We found that the average estimate of n in 
the two cases was 1.02 and 1.49, with standard 
deviations of 0.26 and 0.27. 

Our results for pure Sachs- Wolfe spectra agree 
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Fig. 4. — The maximum-likelihood power spectrum 
obtained by letting all Cis with 2 < I < 19 vary 
freely is shown, together with a, Q — 19 iiK flat 
Sachs- Wolfe power spectrum. The error bars are 
the standard errors determined by approximating the 
likelihood function as a Gaussian near the peak; there 
are significant correlations between the errors. 



well with those of (Gorski et al. 1996), though an 
exact comparison is not possible since we use a 
combination of the maps which excludes the 31 GHz 
map and weights the 53GHz and 90GHz maps 
differently. Furthermore, the KL transform and 
the orthogonalized spherical harmonic technique of 
Gorski et al. give give slightly different weights to 
the various spherical harmonic modes. We therefore 
would not expect perfect quantitative agreement 
between our results even if we used identical maps. 
We would of course expect any discrepancies to be 
well within the errors, though, and in fact we find 
that this is the case. 

Gorski et al. (1996) quote a maximum likelihood 
point of (n, Q) = (1.22, 15.9/iK) and a best fitting 
scale-invariant normalization of Q = 18.7/xK if 
they include the quadrupole and use the "coadded" 
ecliptic maps with the custom cut (the closest to 
our procedure). Our maximum likelihood point 
is (n, Q) = (1.2, 16.2/iK) and our best fitting 
scale- invariant normalization Q = 18.7/iK. The 



difference in likelihood between (1.2, 16.2/iK) and 
(1.22, 15.9/iK) in our calculation is only 3%, so 
this agreement is very good. For the quadrupole 
excluded results, ( Gorski et al. 199^ ) find the most 
likely point is {n,Q) = (1.00, 19.1/xK) while we find 
(rijQ) — (1.00, 19.6/iK). Again the best fitting point 
of ( Gorski et al. 199"6| ) is only slightly less likely than 
our maximum likelihood point and well within our 
la error ellipse. 

The best-fitting Cg spectrum is shown in 
Fig. ^. To determine this spectrum, we let all 
Cf's for 2 < £ < 19 vary independently until the 
likelihood was maximized. A fiat power spectrum 
with Q = 19 fiK is shown for comparison. The 
error bars shown in this figure are standard errors 
determined by approximating the likelihood near 
the peak as a Gaussian. The standard errors are 
then the square roots of the diagonal elements 
of the covariance matrix of this Gaussian. Error 
bars determined in this way should be viewed with 
extreme caution. First, the likelihood is not very 
well approximated by a Gaussian: on the contrary, 
it is strongly skew-positive at low £. Second, 
these standard errors contain no information about 
correlations between the errors. These correlations 
are largest for pairs of modes whose ^-values differ 
by 2. Coupling between modes with Ai = 1 are 
weak because the data has approximate refiection 
symmetry. The normalized correlation coefficient 
Cov{Ci, QO/ VVar(C£)Var(CfO is typically - 15% 
at low £ for A£ = 2 and decreases to ~ 5% for larger 
£. The correlations are typically ~ 5% for A£ = 4 
over the whole range in £. The deceptively small 
error bar on the estimate of C2 is largely due to 
the failure of the Gaussian approximation for the 
likelihood, although the 15% anticorrelation between 
C2 and C4 also plays a role. 



2.4. Filtering the Data 

The KL expansion is essentially a technique for 
optimally retaining signal and throwing away noise 
in a data set. As one might expect, it is closely 
related to similar methods, such as the Wiener 
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filter (Bunn, HofTman, & Silk 1996). Assuming that 
the statistical properties of the signal and noise 
are known, applying the Wiener filter to a map 
preferentially removes noise and leaves signal. It is 
the optimal linear filter which can be constructed for 
this purpose. 

Wiener filtering is particularly easy to perform 
in the KL basis. Specifically, the amplitude of the 
ath eigenmode of the Wiener-filtered sky map is 
obtained simply by multiplying the amplitude of the 
same mode of the raw map by the eigenvalue Xa- 
In Fig. H we show the result of Wiener-filtering the 
DMR data in this way, as well as a map obtained 
by simply truncating the KL expansion after 400 
modes. 



3. Prequentist limits 

The limits and error bars we have quoted so 
far in this paper have been based upon Bayesian 
rather than frequentist techniques. This has become 
standard practice in CMB data analysis, largely 
because Bayesian results are often easier to compute 
than frequentist ones. For a Bayesian, the likelihood 
function is all that is needed, whereas to find the 
boundaries of a frequentist confidence interval 
one also needs to know the theoretical probability 
distribution of some goodness-of-fit statistic. These 
probability distributions can often only be computed 
by time-consuming Monte Carlo simulations. 

Although there is nothing intrinsically 
wrong with Bayesian methods, there are certain 
circumstances in which a frequentist approach is 
to be preferred. First, Bayesian estimates may 
sometimes depend too strongly on the assumed prior, 
making interpretation of the results controversial. 
As a rule, this is a serious problem only when the 
parameter being estimated is weakly constrained by 



the data (e.g. Bunn et al. 1994). Second, although 
Bayesian techniques supply relative probabilities 
of different models, they are in general unable to 
determine the intrinsic goodness of fit of a single 
model. As we saw above, we are able to find 
the most probable values of parameters like the 



normalization Q and the spectral index n; however, 
Bayesian methods are unable to tell us whether or 
not the best fit is actually a good fit. 

It is therefore of interest to consider ways of 
applying frequentist techniques to the COBE data. 
(We have in fact mentioned some frequentist error 
estimates above, when we discussed the Monte 
Carlo simulations we performed to test for bias 
in our maximum-likelihood parameter estimates.) 
We begin by reminding the reader of the general 
approach a frequentist takes to hypothesis testing. 

Frequentist statistical analyses are always based 
on some goodness-of fit statistic rj. The statistic 
should be chosen in such a way that it takes on 
low values when the data fit the model well and 
large values when the fit is poor. For a particular 
hypothesis, we compute the probability distribution 
of 77, and we say that the hypothesis is ruled out with 
significance s if P{r] > fy) < s, where fj is the value 
found in the actual data. For such an analysis to 
have high power, it is obviously essential to choose rj 
wisely. 

One common choice of goodness-of-fit statistic 
is the likelihood ratio. To illustrate this choice, let us 
consider a family of spatially flat, CDM models with 
a cosmological constant (ACDM) and n = 1. These 
models are characterized on COBE scales by only 
two parameters, the normalization Q and the density 
VIq = 1 — JIa. If we wish to test the hypothesis that 
ilo takes on some specific value fig, we choose as a 
goodness-of-fit statistic 



77 



maxQ^Hp L(Q,r?o) 
maxQ L{Q, fl^) 



(12) 



If fig is far from ^Iq, then we expect this ratio to 
be large. In order to decide whether a particular 
value of rip is allowed by the data, we compute 
the probability distribution of rj by making many 
simulated COBE sky maps with a power spectrum 
corresponding to flo — f^o ^'^^ computing the 
likelihood ratio ([l^ ) for each one. This time- 
consuming process must be repeated for each value 
of ilg we are interested in. 

Upon performing this procedure for Qq = 0.1, 
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we find that the likelihood ratio is 3.16. Only 10% of 
simulated maps give a larger value, and we therefore 
conclude that this particular model is ruled out with 
a significance of 10% (or at 90% confidence). By way 
of comparison, we shall see below that a Bayesian 
analysis of the same family of models says that a 
95% credible lower limit on Hq is 0.13. 

Note that this family of models is an example 
of a situation where the Bayesian limit will have 
significant prior dependence, since as Fig. ^ shows, 
the likelihood is large over a significant fraction of 
the allowed parameter space. A frequentist limit 
is therefore quite useful. Unfortunately, frequentist 
limits based on likelihood ratios are extremely 
expensive to compute, since the probability 
distribution of r) must be recomputed by Monte 
Carlo simulation for each point in parameter space. 

The use of likelihood ratios to set frequentist 
limits alleviates only one of the two disadvantages 
of the Bayesian approach: we have "removed" the 
prior dependence, but the likelihood ratio still gives 
no information about intrinsic goodness of fit: even 
if the data are an intrinsically poor fit to the whole 
family of models, there will still be a value of fig 
for which the likelihood will peak and 77 will take on 



its lowest possible value. In (White & Bunn 1995) 



we proposed a different goodness-of-fit statistic to 
resolve this problem, and we will now describe a 
slight variant of this statistic. 

Let X be the vector of amplitudes in our 
eigenmode expansion. For a particular theoretical 
model, we can compute the theoretical covariance 
matrix A4 = {xaP"), and then transform to a new 
basis: 

y = M-'^/'^x. (13) 

Any convenient square root of the covariance 
matrix may be chosen; we recommend Cholesky 
decomposition. The advantage of changing basis in 
this way is that the predicted covariance matrix of y 
is the identity matrix. That is, each yi is predicted 
to be an independent unit Gaussian random variable. 
We shall see that this can save us a great deal of 
effort. 

Next, we sort the {yi} from largest to smallest 



angular scale probed (by computing the mean value 
of € in a spherical harmonic expansion) and sum yf 
over bins of some size K: 



aK 



(14) 



= (a-l)i<'+l 



where 1 < a < NjK and N is the total number of 
modes. If our theoretical model is correct, then each 
Za should be X/f"distributed and should therefore 
have expectation value K . If our model is incorrect, 
then the dispersion of Za about this expectation 
value should be larger. We therefore propose as a 
goodness-of-fit statistic the variance. 



NjK 



2N ^ 



Kf 



(15) 



The prefactor 1/2N is arbitrary; we chose it so that 
the expectation value of rj is one when the model is 
correct. 

One great advantage of this statistic is that 
its probability distribution is model-independent: 
T] is simply the average of N/K independent 
X^-distributed random variables. We need to 
compute this probability distribution only once. 
Furthermore, unlike the likelihood ratio, this statistic 
is in principle capable of testing the possibility that 
the whole family of models under consideration are 
poor fits to the data. 



The statistic (^^ has two parameters: the 
number of modes N and the bin size K. We 
experimented extensively with simulated data sets 
to find the values of those statistics that maximized 
the power to reject incorrect models]^ We made 
simulated sky maps with a flat fio = 1 power 
spectrum and tried to find values of the parameters 
that maximized our ability to reject an fig = 0.1 
model. We found that the greatest power was 
achieved when the number of bins was fairly small; 
we chose N = 200 and K = 40, so that the number 



^ It is of course crucial to perform such experiments on 
simulated data sets before looking at the real data; choosing a 
statistic based on its behavior when applied to the real data is 
a statistical faux pas. 
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of bins is only five. Unfortunately, the power of this 
statistic proves to be quite low: we are typically able 
to reject the low-57o niodel with significances of only 
~ 20%. As we have seen, the likelihood ratio allows 
significantly greater power, rejecting the same model 
at a significance of 10%. 

It is in general difRcult to tell a priori whether a 
particular goodness-of-fit statistic will be a powerful 
discriminator among a given class of models; 
the statistic 77 does not perform as well in this 
regard as one might have hoped. Although this 
statistic has low power, it is perhaps nonetheless 
worth quoting some results based on it, since it 
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is capable of supplying intrinsic goodness-of-fit Fig. 6 

values for individual models (a task for which the CDM 

likelihood-ratio is ill-suited). We find that, for the tensor 

Harrison-Zel'dovich ACDM models we have been arbitrarily 

considering in this section, the significance ranges differ s: 

from 29% when fio = 1 to 17% when Hq = 0.1, with curve, whi 



higher significances denoting better fits. The fact 
that all of these significance levels are acceptable 
is somewhat reassuring: in principle we could have 
discovered that the whole class of models fit the 
data at an unacceptably poor level, in which case 
estimating parameters by means of likelihood ratios 
or Bayesian methods would be a suspect procedure. 



4. Normalization of the Anisotropy 
Spectrum 

We now turn to the problem of quoting the 
best-fitting amplitude for various models. We break 
this into two parts: determining the best-fitting 
amplitude of the radiation power spectrum and the 
best-fitting amplitude of the matter power spectrum. 
We discuss the first problem in this section. Since the 
ratio of the radiation to the matter power spectrum 
is an output of the model under consideration, the 
second follows from the first and will be considered 



The inadequacy of using only the RMS 
fiuctuation in the map to determine the amplitude 
pf a given model from hao boon dioouoocd by (Bunn 



The angular power spectra i{i + l)Ce for 
with ^Bh^ = 0.01, 0.02, 0.03. The 
spectrum is also shown. All models are 
normalized at ^ = 10. Note that the curves 
ignificantly, even at low £, from the flat dashed 
:h represents a pure Sachs- Wolfe spectrum. 



& Bunn 1995). As an example, normalizing a 
scale-invariant Sachs- Wolfe spectrum (Eq. |l^) to the 
RMS anisotropy smoothed to 10°, trio = 29 ± 1/iK, 
one would obtain 10"Cio ^ 0.43 ± 0.03. This 
is 30% 2a) lower than obtained from a fit to 
the full data set. However, while the data clearly 
cannot be summarized by one number, keeping the 
information from each and every pixel is redundant 
for the theories we are considering. Since fitting 
theories directly to the data is quite time consuming, 
in what follows we will quote results which allow a 
simple but accurate method for normalizing a given 
radiation spectrum to the full COBE data. 

For most models of structure formation the 
angular power spectrum at low £ contains little 
structure. If we could simply parameterize the 
shapes of the spectra of interest, we could pre- 
compute and tabulate the normalizations from the 
COBE data. Which types of spectra should we 
investigate? As can be seen in Figs. 6 and |^, the 
radiation power spectrum for a cold dark matter 
model is not that of a pure Sachs- Wolfe spectrum 



Jiiott White IQQS r Baiida^' ut al. H^rWfart 



of gravitational potential perturbations ( Abbott 



& Wise 1984, Bond & Efstathiou 1987, White et 
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Fig. 7. — The shapes of the angular power spectra are 
plotted for various CDM models. The upper panel 
shows spatially flat models with fio + f^A = 1, and the 
lower panel shows models with J7a — 0. All models 
have n — \ and are labeled with the appropriate value 

of r^o- 



al. 1994, [Peebles 199^ ): 

r(3- 



Ci oc 



2 I 



r2(4^)r(^- 



(16) 



For standard CDM the effect of including the 
full Ci spectrum (that is adding the integrated 
Sachs- Wolfe and velocity contributions) is to reduce 
the best- fitting normalization by ~ 10% at C2, 
which gives an indication of the curvature of the 
spectrum. Including a full numerical calculation of 
the radiation spectrum also introduces dependences 
on the cosmological parameters. All except for fig 
and n are small, as discussed in 



We follow ( [White fc Bunn 1995| ) in 
parameterizing the radiation power spectrum 
in terms of quadratics. Specifically, if we write 

D(x) = i{l + 1)C£ with X = logio ^ ■. 



(17) 



then we provide the normalization for quadratic 
D{x) using the methods outlined in §H 

To normalize a theory then, one computes the 
radiation power spectrum for that theory, finds the 
closest quadratic approximation to it over the range 
relevant to COBE (roughly £ — 2 to 30) and reads 



off the normalization which we provide for that 
quadratic. This works quite well for a large range of 
theories. For example, for open and ACDM models 
with 0.2 < < 1 and 0.8 < n < 1.1 we find at most 
a 1% difference between normalizing to a quadratic 
approximation!^ and normalizing to the full theory, 
as shown in Table [|. 

We choose to parameterize the shape of the 
spectrum by the (normalized) first and second 
derivatives at x = 1, specifically D' and D" where 



D{x) ~ Di 



(18) 



Note that D' and D" are 1/Z?i times the first and 
second derivatives of D{x) at a; = 1. The overall 
normalization is Di = llOCio- Below we quote the 
normalization as Di or Cio, for each {D',D") pair, 
and the goodness of fit by the relative likelihood 
of that shape compared to a featureless, n = 1, 
Sachs- Wolfe spectrum: {D',D") = (0,0). 

In terms of this parameterization we find that 
the following fitting function 

InL = -0.01669+ 1.198951?' (19) 
-0.83527L''2 - 0.43541L>" 
-0.03421i:>'7:>" + 0.01049i:»"2 

describes the likelihood function with an error 
of 6% in L over the range —0.6 < D' < 0.6 
and — 1 < D" < 3. The peak likelihood is at 
{D',D") = (—0.1,-4), which is outside the range 
of the fitting function, and the likelihood at peak is 
8.9 times that for a flat spectrum. The likelihood is 
plotted as a function of D' and D" in Fig. ^. 

Similarly, the best-fitting value of Cio is well 
approximated by 

10"Cio = 0.64575 + 0.022821?' (20) 
+0.01391L»'2 - 0.018191?" 
-0.00646I?'L'" + 0.00103i:'"2 



^Specifically we use the best-fit quadratic to the C( between 
= 3 and 20 with points weighted equally in log^. 
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Likelihood 



Fig. 8.— The hkehhood L{D',D") is plotted for 
power spectra given by Eq. (18). Likelihoods are 
normalized so that a flat power spectrum (D' — D" = 
0) has L = 1, and the contours range from 1 to 8. 

with an error of 1% over the same range. The la 
statistical uncertainty in Cio is 13.8%. Uncertainties 
in rms quantities are therefore 7%.^ 

Several popular theories (e.g. open and ACDM 
models) have radiation power spectra which are 
not well fit by quadratics all the way down to the 
quadrupole (e.g. Figs. 6, 0). This is especially true 
if a tensor component is added since the £ — 2 
mode is high in the tensor spectrum. While the 
best-fitting normalization is not too sensitive to the 
assumption of a quadratic over the entire range, this 
is not true of the likelihood. We quantify in Fig. ^ 
and in Table ^ what error in the likelihood function 
and the normalization arise from approximating 
several spectra by quadratics. The quadratic 



* These uncertainties are purely statistical. There appears 
to be an additional few-percent systematic uncertainty, based 
on the fact that different pixeliz ations of the maps lead to 
slightly different normalizations ( [Sorski ct al. 199t ). One 



possible estimate of the uncertainty might therefore be the 
quadrature sum of 7% and, say, 3%, or 7.6%. However, 
since systematic errors are generally highly non-Gaussian, a 
simple quadrature sum may underestimate the uncertainty. A 
conservative upper bound on the uncertainty would be the sum 
of statistical and systematic errors, o r 10%. For a more d etailed 
discussion o f systematic errors, see CJorski et al. 1996, Gorski 
2t al. 1996b, Kogut et al. 1996t and references therein. 



0.5 



- 



0.02 



- — 



-0.02 - 



-0.04 



Fig. 9. — The error in the normalization and the 
likelihood that arises if a theory which is quadratic 
in £{£ + l)Ci for £ > £o but constant for £ < £o 
is approximated by the pure quadratic using the 
fitting functions Eqs. (19) and (20). The underlying 
quadratics are taken to be {D',D") =(-0.2,-1; solid), 
(-0.2,1; dotted), (0.2, -1; short-dashed), (0.2,1; long- 
dashed) . 

approximation causes errors of consistently less 
than 1% in the normalization Cio; these errors are 
completely negligible in comparison with statistical 
and systematic uncertainties. The errors in the 
likelihood are somewhat larger, but still not enough 
to significantly affect our conclusions. 

The constraints on models from the shape of 
the COBE power spectrum are not strong. Fig. |l^ 
shows the likelihood function for a family of CDM 
models in which the density and the cosmological 
constant are allowed to vary while the spectral 
index is held fixed. Fig. |ll| shows the likelihood 
function for models with f2A = and models with 
r^o + ^^A = 1 for three values of the spectral index. 

The likelihoods in these figures were computed 
using the quadratic approximation ( [l^ ) to the power 
spectrum rather than the exact power spectrum. 
The fit to a quadratic is good to ^ 3% for ^ > 3, but 
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Table 1: The percentage error introduced by parameterizing theories as quadratics. We have compared the 
normalization and likelihood for 50 ACDM and OCDM theories with 0.8 < n < 1.1 and 0.2 < J7o < 1 as 
computed using Eqs. (19,20) and directly computed from the data using the numerical power spectrum. 
Those theories where using the quadratic approximation induces an error of > 0.5% in Cio or > 10% in 
likelihood are shown above. The column labeled GW indicates whether a component of gravity waves was 
included, as discussed in the text. 
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Fig. 10. — The hkehhood function is shown for a 
family of CDM models with n = 1. The likelihoods 
are normalized so that a flat power spectrum has 
i = 1, and the contours are separated by 0.1 in 
L. The straight dashed line shows the spatially flat 
models. 
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Fig. 11. — Likelihood as a function of fJo for CDM 
models with zero cosmological constant (left) and 
zero spatial curvature (right). The spectral index n 
increases from 0.8 to 1.2 from top to bottom. The 
likelihoods are normalized so that a flat spectrum has 
L = 1. 



in the worst cases the error in C2 can be as much 
as 20%. This causes errors of approximately 10-20% 
in the likelihoods in the worst cases but does not 
change the general features of the plots. 

It is clear from these figures that low values of 
ilo ai'c disfavored, especially in the open models. To 
be more specific, the 95% credible Bayesian lower 
limit on Qq is 0.13 in a scale-invariant {n = 1) 
ACDM model, and the 95% credible region for 
scale-invariant open models is 0.3 < fio < 1-6. Both 
of these limits were computed using a uniform prior 
in r^o- In the case of the cosmological constant 
models, the prior is of course nonzero only when 

o<no<i. 

The low likelihood for low-fip open models 
comes largely from the shape of the power spectrum 
at low £. In fact, a significant portion of our rejection 
power for these models comes from the quadrupole 
information. If we exclude the quadrupole from 



consideration, we find, in agreement with (Gorski et 



al. 1996b), that we have no 95%-confidcncc lower 
limit on Qq in a scale-invariant open model. In the 
case of A models, excluding the quadrupole weakens 
the lower limit on Hq to about 0.08. 



5. Normalization of the Matter Spectrum 

5.1. Notation 

Until now we have talked about the 
normalization of the radiation power spectrum 
on large angular scales. Any theory of structure 
formation predicts a definite ratio for the 
normalization of the radiation and matter power 
spectra. For this reason the COBE DMR detection 
of CMB anisotropics ( ^moot et al. 1992 ) allows us 
to directly normalize the potential fluctuations at 
near-horizon scales. 

For the matter density perturbations, the large- 
scale structure (LSS) data are usually expressed in 
terms of the power spectrum P(k) = \Sk\^, where 
6k is the Fourier transform of the fractional density 
perturbation 



4^^(|k|) 



rf3^ ^(x)e'k.x 



(21) 



Since standard models postulate Gaussian 
fluctuations, specifying the power spectrum 
completely determines the properties of the 
fluctuations. As the model has no preferred 
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direction, the spectrum depends only on the 
magnitude of k. Another measure of P{k) that is 
often used is the contribution to the mass variance 
per unit interval in Ink, denoted A^(fc), which has 
the virtue of being dimensionless: 



to be a power law with index n. The relation of the 
large-scale matter fluctuation amplitude, (5h, to the 
large-angle radiation anisotropy amplitude, VCio, 
is shown in Fig. n2 and will be discussed more in 



5.3-5.4, In the matter-dominated, critical density, 



da. 



2 

mass 



din A: 



2^ 



(22) 



The normalization of P(fc) is frequently quoted in 
terms of 



4 



k \ kr 



(23) 



with r = 8 /i^^Mpc, which measures the variance 
of fluctuations in spheres of radius 8 ft,~-'^Mpc. 
Using the Press-Schechter or peak-patch methods, 
its value can be inferred from the abundance of 



clusters (Bond fc Myers 1991, White, Efstathiou 



fc Frenk 199S , Carlberg et al. 1994^ [Viana fc 
Liddle 1996| , [Bond fc Myers 1996| ) to be erg ~ 0.5-0.8, 



Sachs- Wolfe approximation the proportionality 
constant in Eq. (|l6|) is simply (7r^/8)(5^. 

We find to very good approximation that Sn 
as determined by COBE is independent of both h 
and r^B (see Figs. 13, |l^; for the open models the 
sensitivity to h is similar to the fl^ = 1 case in 
Fig. 13 while the sensitivity to flBh"^ is less than in 
Fig. |lj), although it will depend on Hq and Ha- For 
definiteness we use flBh^ = 0.0125 and h — 0.75. For 
high JIb/i^ and low h one can scale (5h using Figs. 13, 

E 

The value of Sn is related to the common 
inflationary amplitude of the scalar perturbations 



/ith some flij dependence. — Specifically (Viana fc 



As (lurner fc White 1996, Bunn, Liddle fc 



Liddle 1996D -&Tid 



fjg - (0.6±0.1)f7o' 



(24) 



with a ~ 0.4 for open CDM and a ~ 0.45 for 
ACDM. (More accurate fits plus a discussion of the 
uncertainty as a function of flo can be found in 
their paper.) These values are consistent with those 



White 1996| ) by As = flo/gi^o) x Sh, where g{n) is 
the growth factor discussed in §^|^. When we need 
to quote the normalization of the tensor (or gravity 
wave) spectrum independently of the scalars, we 
shall normalize them to At, where at large scales 



(flurner fc White 1996| ) 
dflcw 75 



inferred from large-scale fiows ( Dekel 1994 , Strauss 
fc Willick 1995 ) and direct observations of galaxies 
(e.g. Loveday et al. 199^ ). Note that for very low fio, 
this implies that optical galaxies become anti-biased 



din fc 



32 



fc 

i5o 



-2+nT 



(26) 



(i.e. b : 



/fT8 < 1). 



Given Sn, the value of erg can be calculated 
using Eq. (^). This will introduce an additional 
dependence on n, Qq, h and the dark matter content 
of the universe, e.g. r^HDM- We discuss this in j|5.5|. 



While as has been the traditional means of 
quoting the normalization, COBE probes scales near 
the horizon size today, so we find it cleanest to quote 
the normalization in terms of the amplitude of the 
mass or potential fluctuations at large scales (small 
fc). Specifically we use Jh, the density perturbation 
at horizon-crossing, which is defined through (see 
e.g. [Liddle fc Lyth 1993| ) 



A^(fc) 



fc3p(fc) 

27r2 



5.2. Critical Density Models 

In the simplest picture, in which large-angle 
CMB anisotropies come purely from potential 
fluctuations on the last scattering surface, 
the relative normalization of the CMB and 
matter power spectrum today is straightforward 



T\k), (25) 



with T(fc) the transfer function describing the 
processing of the initial fluctuations, which we take 



(e.g. [Efstathiou 1990i [White et al. 19941 ). In a 
critical-density universe the potentials are constant 
in time. If we also assume adiabatic fluctuations, the 
temperature anisotropy is simply one third of the 
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Fig. 12. — Relation between the normalization of the 
radiation power spectrum (e.g. Cio) and the matter 
power spectrum (e.g. fe) as a function of fio for CDM 
models with scale-invariant scalar perturbations. 
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Hu 1996 for a pedagogical derivation). 



For scale invariant scalar fluctuations in a CDM 
dominated universe with h ~ 0.5 and Qb = 0.05 one 
obtains (5h — 1-92 x 10~^. Over the range of scales 
probed by COBE most models predict spectra which 



Stewart 1996) and that for which 

1-n 



(27) 



aie well appiuAimaLea uy puwui-iaws, su wu wi 
allow ourselves to depart from strictly scale-invariant 
models but keep a power-law initial fluctuation 



which corresponds to power- law inflation ( Lucchin 
fc Matarrese 1984 |Lyth fc Stewart 1992|). In 



^pucLruin. The resulLs fur Lilted {n ^ 1) models aiv 



the flo — 1 and n 
reduces to the more familiar T/S 



'd, ypyi^mi i^uyti uf Eq. 



1 limit the above equation 

7(1 -n) ([Liddlc 



fc Lyth 199^, [Davis et al. 1992| , [Crittenden ct 
al. 1993[ , [Stewart fc Lyth 1993[ ). More details on the 



In addition to scalar density fluctuations 
one must consider the possible contribution of 
gravity waves (tensors) to the COBE fluctuations. 
If this contribution is non-negligible then the 
inferred matter fluctuations are correspondingly 
lower. Conventionally, this is defined in terms of 
the ratio of tensor to scalar contribution to the 
quadrupole: /C\ , also written as T/S. If the 



normalization of inflationary models with tensors, 
and treatment of a wider class of models, can be 
found in ( Bunn, Liddle fc White 1996[ ). For a short 
discussion of the expectations for the amplitude 
of the tensor spectrum from the point of view of 
particle physics models see ( Lyth 19*9^ ), and for 
a review of inflaton potential reconstruction and 



further relations between observables see ( Lidscy ct 



inflationary model is specified then this quantity al. 1996 ) 



is calculable, and is related to the tensor spectral 
index. We shall treat two cases here, that where 
T/S — (corresponding to models with a "low" 



scale of infiation, e.g. Adams et al. 1993, Lyth fc 



Fig. [l^ shows the COBE-normalized value of as 
for a family of flat CDM models with fio = 1- While 
standard n = 1 CDM predicts an unacceptably large 
value of (78 (unless the Hubble constant is extremely 
low) , even a small amount of tilt is sufficient to bring 
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Fig. 14. — The power spectrum normalization for 
scalar perturbations versus flBh? for two flat models. 
Solid lines indicate lO'^^Cio for the best-fitting scale- 
invariant CDM model while dashed lines indicate Sb_- 
The sensitivity of the tensors is similarly small. 

the prediction in line with the observations. 



5.3. Flat Models 

There are several additional complications in 
the case when flo ^ I. Let us first consider models 



Fig. 15. — The COBE-normalized small-scale 
fluctuation amplitude erg is plotted for a family of 
critical CDM models. Solid lines are models with no 
tensor contribution, and dashed lines have T/S = 
7(1 — n). The hatched region shows the observed 
value of fjg (see text). 

However, the g{Qo)/flo behavior is not the 
only effect which occurs in low-i7o universes. Due 
to the fact that the potentials decay at za, there 
is another contribution to the large-angle CMB 
anisotropy measured by COBE. On large angular 



scales, i.e., ^ < 10 (Kofman & Starobinsky 1985, Hr 



with vanishing spatial curvature, but low flo- So 



we assume that there is a contribution from a 
cosmological constant which restores spatial flatness: 
= 1 — r^o- In ACDM models the fluctuations stop 
growing when z = ^ (^o ^ ~ 1)^/^ for fio ^ 1, so 
the overall growth from z ^ 10^ until the present is 
suppressed by (e.g. Carroll et al. 199^ ) 



White 1996 ), the blueshift of a photon falling 



-n 



1 

70 



209^2 
140 



140 



(28) 



into a potential well is not entirely canceled by a 
redshift when it climbs out, the potential having 
decayed during transit. This leads to a net energy 
change, which accumulates along the photon path, 
often called the Integrated Sachs- Wolfe (ISW) 
effect to distinguish it from the more commonly 
considered redshifting which has become known as 
the Sachs- Wolfe effect (both effects were considered 



in the paper of Sachs & Wolfe 1967). Fig. n2 shows 



In addition, the potential fluctuations are reduced 
by f^o- Thus in terms of the power spectrum, 
P{k), we expect for fixed COBE normalization 
that P{k) oc (g(f^o)/f^o)^ ~ il^^-^'^, as has been 



pointed out by (Peebles 1984, Efstathiou, Bond & 



White 1992). Hence for a fixed COBE normalization 



the matter fluctuations today are larger in a 
cosmological constant model than a critical density 
model. 



the relation between the CMB power spectrum 
normalization Cio and the matter power spectrum 
normalization Sa- Note that the g(flo)/^o scaling 
of (5h is fairly accurate for ACDM models where f^A 
domination occurs late, but less accurate for open 
models where curvature domination occurs early. 

A fit to the four-year COBE data for flat models 
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Fig. 16. — Contours of ag are plotted for a family 
of flat CDM models, with n = 0.8, 1.0 and 1.2. 
Contours are spaced by 0.25 with the thick solid 
contour indicating as — 1. In all cases fie/i^ — 0.015 
and we have used the small-scale approximation to 
the transfer function. 

gives the horizon-crossing amplitude 
lO^dnin^no) = 1.94 17n 



-0.785-0.05 In On 



^° exp [an + bn^] , 
(29) 

where n — n — 1, a = —0.95 and b = —0.169 with 
no gravitational waves, and a = 1 and b = 1.97 
with power-law inflation gravitational waves. This 
flt works to better than 3% for 0.2 < JIq < 1 and 
0.7 < n < 1.3, and again the statistical uncertainty 
is 7%. For the power-law inflation case the fit is 
restricted to n < 1. For Hq > 0.3 there is less than a 
5% correction to the g{no)/il.o scaling from the ISW 
component (see Fig. [l^ ). 

Going to smaller scales. Fig. |l^ shows the 
COBE-normalized value of tig for a variety of flat 
CDM models. 



5.4. Open Models 

If we again consider models with low fig but 
now do not introduce a cosmological constant to 
restore spatial flatness, we find that keeping track 
of the ISW effect is extremely important. Even 
though the calculations become more technically 



challenging in an open model, it turns out that the 
largest effect is simple to understand. Since full 
matter domination occurs so far after last-scattering 
and curvature domination occurs so early, the 
gravitational potentials are almost always evolving 
(<& is only constant when the universe is fully matter 
dominated). For this reason, for the JIq of interest 
in structure formation, the ISW effect dominates the 
Sachs- Wolfe effect over the entire range of angular 
scales. Since the temperature anisotropy is enhanced 
by the ISW contribution, the fluctuations in the 
matter required to fit COBE are lowered in an open 
model (see Fig. p^ . 

There are additional complications due to the 
curvature selecting a scale in the universe. This 
means that there is some ambiguity in defining 
the initial power spectrum. We can obtain some 
guidance from inflationary open-universe models 
which predict a nearly scale-invariant spectrum of 
curvature (or gravitational potential) perturbations, 
related to density perturbations through the 
Poisson equation. The density fluctuation spectrum 
corresponding to a power-law curvature perturbation 



is ( |Lyth fc Stewart 1990| or see [White 
and references therein) 



Bunn 1995 



with g2 = k^/{-K) 



P{k) cx 

1 and 



(30) 



-K 



In specific open-universe inflationary models 
based on bubble nuclcation there are corrections 
to this assumption. First P{k) is modified by 
factors which can enhance or suppress power as 
q ^ 0, typically a factor between coth(7r(7/2) and 
tanh(7r(7/2). Second there is a spectrum of "discrete" 
modes with < 1 which can add power at small 
£, and the fluctuations from the bubble wall can 
give rise to perturbations in the open universe also 
(for a readable introduction and overview of open 



inflationary models, see Cohn 1996). Furthermore, 
no calculations have yet been done to asses how 
good a power-law the initial spectrum is supposed 
to be in the individual models. Finally there is the 
possibility that a spectrum of gravitational waves 
is produced, which should occur if the energy scale 
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of inflation is close to the GUT seale. In short, the 
theoretical predictions for the power spectra of such 
models are still quite uncertain. 

To avoid detailed modeling of a particular 
inflationary scenario, we will deal with simple 
power-law spectra of density perturbations here. 
For the range of fio of interest the coth(7r(7/2) and 
tanh(7r(7/2) terms do not affect the normalization 



(Yamanioto & Bunn 1996). From the point of view 



of providing a fit to the large-scale structure and 
CMB data, it appears that Iow-Qq models with 
a slight spectral tilt n ^ 1.1, no super-curvature 
or bubble wall modes and no tensor component 
wiU fit the data best ([White fc Silk 1996| ), thus 
this assumption combines simplicity with current 
prejudice. A fit to the four-year COBE data for such 
models gives the horizon-crossing amplitude 

10^(5H(n,f^o) = 1.95 1]-"-35-0191"f^«-"17n (31) 

exp [-n - 0.14n^] . 

This fit|^ works to better than 3% for 0.2 < < 1 
and 0.7 < n < 1.2, and again the statistical 
uncertainty is 7%. This (5h can be used in Eq. (ESh 
for k » V—K, but note that the behavior of Eq. (pi] ) 
outside the range 0.2 < f2 < 1 is pathological. 

Note that the large difference in the scaling with 
r^o between these models and flat models means that 
the COBE normalization can distinguish open and 
flat low-density models. The addition of tensor or 
supercurvature modes would serve only to further 
suppress the power on small scales in the open 
models, exacerbating the difference. This means that 
open models can be distinguished from flat models 
using the COBE normalization. Fig. |l^ shows the 
CO-B_E-normalized value of erg for a variety of open 
CDM models. For low flo the scale-invariant models 
whose shape flts the galaxy clustering data tend to 
underpredict the cluster abundance (i.e. erg) which is 
the basis of the claim above that models with n > 1 
and no "extra" CMB fluctuations are preferred. 



'After this paper was submitted (Gorski et al. 1996t) also 



considered open CDM models fit to the 4-year data. Where our 
results overlap there is excellent agreement: typically ~ 1% in 
(Tg for the range of Qq of interest in structure formation. 
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Fig. 17. — Contours of erg are plotted for a family 
of open CDM models, with n = 0.8, 1.0 and 1.2. 
Contours are spaced by 0.25, increasing to the top 
right, with the thick solid contour indicating ug = 1. 
In all cases n,Bh^ = 0.015 and we have used the small- 
scale approximation to the transfer function. 

Typically cosmological models are assumed 
either to be spatially flat or to have vanishing 
cosmological constant. However, there is no reason 
in principle not to consider models with both 
curvature and A. The COBE normalization for 
scale-invariant models with arbitrary Qq and Q\ is 
fit by 

10^(5h = 2.422 - 1.166e"° + O.SOOe"^ + 3.780f7o - 



2.267r2oe"'^ 
3.392nAe"° 



0.487r2g -I- 0.56mA + (32) 



.568^7o^2,^ 



i.o8or2i 



to an accuracy of 5% for 0.2 < fig ^ 1-6 and 
< f^A < 0.8. (For a discussion of such models. 



especially the closed ones, see White &: Scott 1996b 



5.5. Going to smaller scales 

Turning the COBE normalization at the horizon 
scale into information about clustering of the mass 
on smaller scales involves the transfer function T{k). 
Several popular fitting functions exist for T(fc) 
in CDM and mixed dark matter (MDM) models. 
However, in order to exploit all of the accuracy of the 
COBE normalization, care must be taken with T{k). 
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Fig. 18. — A comparison of the numerically evaluated 
transfer functions (solid squares and open diamonds) 
with the small-scale approximation discussed in the 
text (lines). Note that even for extreme parameters 
the approximation is quite good around k ^ 
0.2/iMpc^^ where erg receives its main contribution. 



which is accurate on small scales. We use the fitting 



function of (Bardeen et al. 1986 



tCDM 



ln(l + 2.34g) 



2.34g 
3.89fl 



(33) 



[1 

(5.46g)3 



+ (16.1g)2+ 
(6.71,)^]^^/^ 



with q = k/hr, and F given by ( Hu fc Sugiyama 1996| , 
Eq. D-28, E-12). As we show in Fig. |l8| this works 
well even in extreme cases, if one averages over 
the small ripples inherent in the large ^Ib/^q 
transfer function. The deviation for k > 0.1/iMpc~^ 
is less than a few percent, though at larger 
scales it can be significant. The approximation 
F — r^oft. exp(— — Hb/^q) of ( ^ugiyama 1995| ) 
works relatively well at large scales for low fis /^lQ 
but can lead to unacceptable errors in T{k) outside 
of this range. 



As pointed out by^ (Liddlc & Lyth 1993, Peacock 
fc Dodds 1994| ) different parameterizations of the 
transfer function can differ by relatively large 
amounts. In addition, to obtain accuracy to better 
than the 10% level, the effect of baryon damping 



must be included ( Peacock fc Dodds 1994 , Sugiyama 



1995 



Hu fc Sugiyama 1996, Ma 1996, White et 



al. 1996) 



The MDM power spectrum can be 
obtained from the CDM power spectrum by 



multiplying by an additional factor (Pogosyan & 



Starobinsky 1995| , [Liddle et al. 19964 |Ma 1996| ). 
However, no fully satisfactory parameterization of 
the CDM T{k) has been pubhshcd. That is to 
say, no fitting function which is accurate to ^ 1% 
over the range 10~'^hMpc~^ < k < l/iMpc~^ for a 
wide range of baryon densities (the scaling with the 
matter density and Hubble constant is Cloh so can 
be included exactly). 

For the purposes of this paper, we shall compute 
only (jg. Thus we need an approximation to T{k) 



^The formula for T(k) in 



Peacock & Dodds 1994 contains 



a typographical error of which readers should beware. 



6. Implications for Large-Scale Structure 

In principle, COBE lets us constrain models 
of large-scale structure in two independent ways: 
we can use both the shape of the power spectrum 
and the normalization. In practice, though, the 
shape constraints turn out to be quite weak for most 
popular models, as we saw in §^ and Fig. |ri|. We 
therefore focus on the implications of the power 
spectrum normalization. 

Table I gives values of the small-scale density 
fluctuation amplitude as for a selection of models. 
The first line makes it clear that COBE-normalized 
"standard" CDM {n = 1, h = 0.5, ^Bh'^ = 0.0125) 
predicts significantly too much small-scale power 
and is therefore ruled out. However, any of several 
slight changes to the model can easily resolve this 
inconsistency. Perhaps the simplest solution is 
a slight tilt to the power spectrum. Inflationary 
models typically predict spectral indices slightly less 
than one, and a value of n of 0.8 or even less is quite 
natural in such models. 

We have discussed the difference in 
normalization between the open and ACDM 
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n 


h 




CT8 


standard CDM 


1.0 


0.0 


0.0 


1.0 


0.50 


0.0125 


1.22 


tilted CDM 


1.0 


0.0 


0.0 


0.8 


0.50 


0.0250 


0.72 


MDM 


1.0 


0.0 


0.2 


1.0 


0.50 


0.0150 


0.79 


ACDM 


0.4 


0.6 


0.0 


1.0 


0.65 


0.0150 


1.07 


Open CDM 


0.4 


0.0 


0.0 


1.0 


0.65 


0.0150 


0.64 


Low h CDM 


1.0 


0.0 


0.0 


1.0 


0.35 


0.0150 


0.74 



Table 2: Small-scale normalizations for various models. 



models in the previous section. By reducing 
the density we reduce the small-scale fluctuation 
amplitude as measured, say, by erg. Since cs is 
typically too large in "standard" critical-density 
CDM models, such low-density models tend to fare 
better. As we saw above, the suppression of cts 
is greater in open models than in ACDM models. 
These results are quantified for some specific models 



depends upon the details of that model, but the 
above argument is general enough that we may 
expect it to hold in a large class of models. Since 
the adiabatic models predict roughly the right 
amount of power when compared to large-scale 
structure measurements on smaller scales, the 
generic isocurvature models predict too little. These 



issues are dealt with in more detail in White & 



in Table 2 



Scott 1996a and references therein. 



Another obvious impact of the COBE 
normalization is that it differentiates between the 
classes of theories based on adiabatic fluctuations 
(e.g. inflation) and those based on isocurvature 



fluctuations (e.g. topological defect models). In 



Discussion of particular models of structure 
formation is beyond the scope of this paper. 
Analyses based on the four-year COBE data have 
been performed for critical-density models ( White| 
et al. 199e| ), ACDM mo dels (|Liddle et al. 1996b|), 



the adiabatic models, the fluctuations in all the 



constituents have the same sign, so an overdensity 



and open CDM models ( |G6rski et al. 1996b| , |White| 
fc Silk 199^ ). The main difference between the 



of photons is also an overdensity of baryons and 
cold dark matter. This means that photons from 



four- year results and those based on analyses of the 



two-year data (e.g., Gorski et al. 1995, Stompor 



overdense regions (which are hotter than average, Gorski & Banday 1995, White & Bunn 1995, See 



n ~ T^) must climb out of larger than average 
potential wells, in the process losing energy. In 
isocurvature models it is "cold" photons which 
must climb out of such potential wells. Thus, 
in the adiabatic case the "intrinsic" temperature 
fluctuation and the gravitational redshifting effect 
partially cancel, while in the isocurvature case they 
add. This leads to smaller temperature fluctuations 
for a given matter fluctuation in the adiabatic 
case, or conversely larger matter fluctuations when 
normalized to the temperature fluctuations observed 
by COBE. 

The relative normalization of the matter and 
temperature fluctuations in any particular model 



also White & Scott 1996a and references therein.) 
is a reduction of ~ 10% across the board in the 
COBE normalizations. Part of this reduction is due 
to Galactic contamination in the two-year analyses: 
the two-year data were typically analyzed with a 
straight 20° Galactic cut, as opposed to the slightly 



more extensive "custom cut" (Bennnett et al. 1996) 
that has been applied to the four-year data. 



7. Conclusions 

The four-year COBE DMR sky maps 
provide some of the most stringent constraints 
on cosmological models. Because the data are so 
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powerful, and because they are the best large-angle 
CMB data we are likely to have for the next few 
years, it is important to extract as much information 
as possible from them. In particular, when using 
COBE to normalize models for large-scale structure 
formation, one should perform a full fit to the 
data, rather than using a single number such as the 
root-mean-square temperature fluctuation smoothed 
on some angular scale. 

We have provided normalizations and likelihoods 
based on the four- year DMR data for a broad class of 
cosmological models. In conjunction with small-scale 
measurements of the matter power spectrum such 
as as, these COBE normalizations allow us to place 
significant constraints on the parameters of various 
cosmological models. Furthermore, CMB data on 
smaller angular scales is improving rapidly in both 
quality and quantity. By comparing these data with 
the estimates of the COBE power spectrum, it will 
soon be possible to severely constrain models on the 
basis of the CMB angular power spectrum alone. 

We wish to thank W. Hu, A. Liddle, D. Scott 
and J. Silk for useful conversations and E. Wright 
for supplying the software used to make Fig. |[ EFB 
was supported by NASA. 
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Fig. 5. — The four-year COBE DMR data are plotted in Aitoff projection. The top panel shows the raw 
pixel data. The middle panel is the result of filtering the data by removing all but the first 400 signal-to-noise 
eigenmodes, and the lower panel shows the result of applying a Wiener filter. 
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